Community Detection via Semidefinite Relaxation
نویسنده
چکیده
Notes for lecture given by the author on November 7, 2014 as part of the special course: “Randomness, Matrices and High Dimensional Problems”, at IMPA, Rio de Janeiro, Brazil. The results presented in these notes are from [1]. 1. The problem we will focus on Let n be an even positive integer. Given two sets of n2 nodes consider the following random graph G: For each pair (i, j) of nodes, (i, j) is an edge of G with probability p if i and j are in the same set and q if they are in different sets. Each edge is drawn independently and p > q. (Think nodes as fans of Fluminense and Flamengo and edges representing friendships, in this model, fans of the same club are more likely to be friends) For which values of p and q can we recover the partition, with an efficient algorithm, from only looking at the graph G (with high probability)? 2. The interesting regime If p logn n then it is easy to see that each cluster will not be connected (with high probability) and so recovery is not possible. In fact, the interesting regime is when p = α log(n) n and q = β log(n) n , (2.1) for constants α > β. Let A be the adjacency matrix of G, meaning that Aij = { 1 if (i, j) ∈ E(G) 0 otherwise. (2.2) Let x ∈ R with xi = ±1 represent a partition (note there is an ambiguity in the sense that x and −x represent the same partition). Then, if we did not worry about efficiency then our guess (which corresponds to the Maximum Likelihood Estimator) would be the solution of
منابع مشابه
Performance of a community detection algorithm based on semidefinite programming
The problem of detecting communities in a graph is maybe one the most studied inference problems, given its simplicity and widespread diffusion among several disciplines. A very common benchmark for this problem is the stochastic block model or planted partition problem, where a phase transition takes place in the detection of the planted partition by changing the signal-to-noise ratio. Optimal...
متن کاملA semidefinite relaxation scheme for quadratically constrained
Semidefinite optimization relaxations are among the widely used approaches to find global optimal or approximate solutions for many nonconvex problems. Here, we consider a specific quadratically constrained quadratic problem with an additional linear constraint. We prove that under certain conditions the semidefinite relaxation approach enables us to find a global optimal solution of the unde...
متن کاملSemidefinite relaxation for dominating set
‎It is a well-known fact that finding a minimum dominating set and consequently the domination number of a general graph is an NP-complete problem‎. ‎In this paper‎, ‎we first model it as a nonlinear binary optimization problem and then extract two closely related semidefinite relaxations‎. ‎For each of these relaxations‎, ‎different rounding algorithm is exp...
متن کاملCommunity detection in sparse networks via Grothendieck's inequality
We present a simple and flexible method to prove consistency of semidefinite optimization problems on random graphs. The method is based on Grothendieck’s inequality. Unlike the previous uses of this inequality that lead to constant relative accuracy, we achieve any given relative accuracy by leveraging randomness. We illustrate the method with the problem of community detection in sparse netwo...
متن کاملThe application of semidefinite programming for detection in CDMA
In this paper, a detection strategy based on a semidefinite relaxation of the CDMA maximum-likelihood (ML) problem is investigated. Cutting planes are introduced to strengthen the approximation. The semidefinite program arising from the relaxation can be solved efficiently using interior point methods. These interior point methods have polynomial computational complexity in the number of users....
متن کاملOrientation Determination from Cryo-EM images Using Least Unsquared Deviation
A major challenge in single particle reconstruction from cryo-electron microscopy is to establish a reliable ab-initio three-dimensional model using two-dimensional projection images with unknown orientations. Common-lines based methods estimate the orientations without additional geometric information. However, such methods fail when the detection rate of common-lines is too low due to the hig...
متن کامل